Overview

Dataset statistics

Number of variables13
Number of observations2968
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory301.6 KiB
Average record size in memory104.0 B

Variable types

Numeric13

Alerts

gross_revenue is highly correlated with qtde_invoices and 3 other fieldsHigh correlation
recency_days is highly correlated with qtde_invoicesHigh correlation
qtde_invoices is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qtde_items is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qtde_products is highly correlated with gross_revenue and 3 other fieldsHigh correlation
avg_ticket is highly correlated with avg_unique_basket_sizeHigh correlation
avg_recency_days is highly correlated with frequencyHigh correlation
frequency is highly correlated with avg_recency_daysHigh correlation
avg_basket_size is highly correlated with gross_revenue and 1 other fieldsHigh correlation
avg_unique_basket_size is highly correlated with qtde_products and 1 other fieldsHigh correlation
gross_revenue is highly correlated with qtde_invoices and 1 other fieldsHigh correlation
qtde_invoices is highly correlated with gross_revenue and 2 other fieldsHigh correlation
qtde_items is highly correlated with gross_revenue and 1 other fieldsHigh correlation
qtde_products is highly correlated with qtde_invoicesHigh correlation
avg_ticket is highly correlated with qtde_returns and 1 other fieldsHigh correlation
qtde_returns is highly correlated with avg_ticketHigh correlation
avg_basket_size is highly correlated with avg_ticketHigh correlation
gross_revenue is highly correlated with qtde_invoices and 2 other fieldsHigh correlation
qtde_invoices is highly correlated with gross_revenue and 2 other fieldsHigh correlation
qtde_items is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qtde_products is highly correlated with gross_revenue and 2 other fieldsHigh correlation
avg_recency_days is highly correlated with frequencyHigh correlation
frequency is highly correlated with avg_recency_daysHigh correlation
avg_basket_size is highly correlated with qtde_itemsHigh correlation
gross_revenue is highly correlated with qtde_invoices and 4 other fieldsHigh correlation
qtde_invoices is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qtde_items is highly correlated with gross_revenue and 4 other fieldsHigh correlation
qtde_products is highly correlated with gross_revenue and 3 other fieldsHigh correlation
avg_ticket is highly correlated with qtde_returns and 1 other fieldsHigh correlation
qtde_returns is highly correlated with gross_revenue and 5 other fieldsHigh correlation
avg_basket_size is highly correlated with gross_revenue and 4 other fieldsHigh correlation
avg_unique_basket_size is highly correlated with avg_basket_sizeHigh correlation
avg_ticket is highly skewed (γ1 = 25.15706781) Skewed
frequency is highly skewed (γ1 = 24.87675009) Skewed
qtde_returns is highly skewed (γ1 = 21.9754032) Skewed
df_index has unique values Unique
customer_id has unique values Unique
recency_days has 33 (1.1%) zeros Zeros
qtde_returns has 1481 (49.9%) zeros Zeros

Reproduction

Analysis started2021-12-04 23:54:36.984304
Analysis finished2021-12-04 23:54:57.194336
Duration20.21 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct2968
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2334.939353
Minimum0
Maximum5766
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:57.288322image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile185.35
Q1936.5
median2135.5
Q33563.25
95-th percentile5079.3
Maximum5766
Range5766
Interquartile range (IQR)2626.75

Descriptive statistics

Standard deviation1568.287987
Coefficient of variation (CV)0.6716611224
Kurtosis-1.006531709
Mean2334.939353
Median Absolute Deviation (MAD)1280
Skewness0.3457441718
Sum6930100
Variance2459527.209
MonotonicityStrictly increasing
2021-12-04T20:54:57.400303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
30321
 
< 0.1%
30171
 
< 0.1%
30181
 
< 0.1%
30211
 
< 0.1%
30221
 
< 0.1%
30231
 
< 0.1%
30241
 
< 0.1%
30271
 
< 0.1%
30291
 
< 0.1%
Other values (2958)2958
99.7%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
ValueCountFrequency (%)
57661
< 0.1%
57471
< 0.1%
57371
< 0.1%
57311
< 0.1%
57101
< 0.1%
57061
< 0.1%
57001
< 0.1%
56891
< 0.1%
56881
< 0.1%
56781
< 0.1%

customer_id
Real number (ℝ≥0)

UNIQUE

Distinct2968
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15270.37702
Minimum12347
Maximum18287
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:57.508302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum12347
5-th percentile12619.35
Q113798.75
median15220.5
Q316768.5
95-th percentile17964.65
Maximum18287
Range5940
Interquartile range (IQR)2969.75

Descriptive statistics

Standard deviation1719.144523
Coefficient of variation (CV)0.1125803587
Kurtosis-1.206178196
Mean15270.37702
Median Absolute Deviation (MAD)1489
Skewness0.03219371129
Sum45322479
Variance2955457.892
MonotonicityNot monotonic
2021-12-04T20:54:57.613338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
178501
 
< 0.1%
126701
 
< 0.1%
177341
 
< 0.1%
149051
 
< 0.1%
161031
 
< 0.1%
146261
 
< 0.1%
148681
 
< 0.1%
182461
 
< 0.1%
171151
 
< 0.1%
166111
 
< 0.1%
Other values (2958)2958
99.7%
ValueCountFrequency (%)
123471
< 0.1%
123481
< 0.1%
123521
< 0.1%
123561
< 0.1%
123581
< 0.1%
123591
< 0.1%
123601
< 0.1%
123621
< 0.1%
123641
< 0.1%
123701
< 0.1%
ValueCountFrequency (%)
182871
< 0.1%
182831
< 0.1%
182821
< 0.1%
182771
< 0.1%
182761
< 0.1%
182741
< 0.1%
182731
< 0.1%
182721
< 0.1%
182701
< 0.1%
182691
< 0.1%

gross_revenue
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2953
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2693.389373
Minimum6.2
Maximum279138.02
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:57.727340image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum6.2
5-th percentile229.7325
Q1570.845
median1085.51
Q32306.905
95-th percentile7169.562
Maximum279138.02
Range279131.82
Interquartile range (IQR)1736.06

Descriptive statistics

Standard deviation10135.32607
Coefficient of variation (CV)3.763037818
Kurtosis397.3184084
Mean2693.389373
Median Absolute Deviation (MAD)671.39
Skewness17.63574461
Sum7993979.66
Variance102724834.5
MonotonicityNot monotonic
2021-12-04T20:54:57.836304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1078.962
 
0.1%
2053.022
 
0.1%
3312
 
0.1%
1353.742
 
0.1%
889.932
 
0.1%
745.062
 
0.1%
379.652
 
0.1%
2092.322
 
0.1%
731.92
 
0.1%
734.942
 
0.1%
Other values (2943)2948
99.3%
ValueCountFrequency (%)
6.21
< 0.1%
13.31
< 0.1%
151
< 0.1%
36.561
< 0.1%
451
< 0.1%
521
< 0.1%
52.21
< 0.1%
52.21
< 0.1%
62.431
< 0.1%
68.841
< 0.1%
ValueCountFrequency (%)
279138.021
< 0.1%
259657.31
< 0.1%
194550.791
< 0.1%
140438.721
< 0.1%
124564.531
< 0.1%
117375.631
< 0.1%
91062.381
< 0.1%
72882.091
< 0.1%
66653.561
< 0.1%
65019.621
< 0.1%

recency_days
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct272
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64.31030997
Minimum0
Maximum373
Zeros33
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:57.943302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q111
median31
Q381
95-th percentile242
Maximum373
Range373
Interquartile range (IQR)70

Descriptive statistics

Standard deviation77.76031378
Coefficient of variation (CV)1.209142264
Kurtosis2.77659321
Mean64.31030997
Median Absolute Deviation (MAD)26
Skewness1.79807024
Sum190873
Variance6046.666399
MonotonicityNot monotonic
2021-12-04T20:54:58.053302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199
 
3.3%
487
 
2.9%
285
 
2.9%
385
 
2.9%
876
 
2.6%
1067
 
2.3%
966
 
2.2%
766
 
2.2%
1764
 
2.2%
2255
 
1.9%
Other values (262)2218
74.7%
ValueCountFrequency (%)
033
 
1.1%
199
3.3%
285
2.9%
385
2.9%
487
2.9%
543
1.4%
766
2.2%
876
2.6%
966
2.2%
1067
2.3%
ValueCountFrequency (%)
3732
0.1%
3724
0.1%
3711
 
< 0.1%
3681
 
< 0.1%
3664
0.1%
3652
0.1%
3641
 
< 0.1%
3601
 
< 0.1%
3591
 
< 0.1%
3584
0.1%

qtde_invoices
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct56
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.724056604
Minimum1
Maximum206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:58.382306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile17
Maximum206
Range205
Interquartile range (IQR)4

Descriptive statistics

Standard deviation8.857882575
Coefficient of variation (CV)1.5474834
Kurtosis190.7771511
Mean5.724056604
Median Absolute Deviation (MAD)2
Skewness10.76520644
Sum16989
Variance78.46208371
MonotonicityNot monotonic
2021-12-04T20:54:58.499303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2785
26.4%
3498
16.8%
4393
13.2%
5237
 
8.0%
1190
 
6.4%
6173
 
5.8%
7138
 
4.6%
898
 
3.3%
969
 
2.3%
1055
 
1.9%
Other values (46)332
11.2%
ValueCountFrequency (%)
1190
 
6.4%
2785
26.4%
3498
16.8%
4393
13.2%
5237
 
8.0%
6173
 
5.8%
7138
 
4.6%
898
 
3.3%
969
 
2.3%
1055
 
1.9%
ValueCountFrequency (%)
2061
< 0.1%
1991
< 0.1%
1241
< 0.1%
971
< 0.1%
912
0.1%
861
< 0.1%
721
< 0.1%
622
0.1%
601
< 0.1%
571
< 0.1%

qtde_items
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1664
Distinct (%)56.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1579.712264
Minimum1
Maximum196844
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:58.626304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile101.35
Q1296
median638
Q31398.25
95-th percentile4403.25
Maximum196844
Range196843
Interquartile range (IQR)1102.25

Descriptive statistics

Standard deviation5700.529956
Coefficient of variation (CV)3.608587516
Kurtosis518.1228414
Mean1579.712264
Median Absolute Deviation (MAD)419
Skewness18.7602581
Sum4688586
Variance32496041.78
MonotonicityNot monotonic
2021-12-04T20:54:58.738330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31011
 
0.4%
889
 
0.3%
1509
 
0.3%
2608
 
0.3%
848
 
0.3%
2888
 
0.3%
2728
 
0.3%
2468
 
0.3%
5167
 
0.2%
3947
 
0.2%
Other values (1654)2885
97.2%
ValueCountFrequency (%)
11
< 0.1%
22
0.1%
122
0.1%
161
< 0.1%
171
< 0.1%
181
< 0.1%
191
< 0.1%
201
< 0.1%
231
< 0.1%
251
< 0.1%
ValueCountFrequency (%)
1968441
< 0.1%
799631
< 0.1%
773731
< 0.1%
699931
< 0.1%
645491
< 0.1%
641241
< 0.1%
628121
< 0.1%
582431
< 0.1%
577851
< 0.1%
502551
< 0.1%

qtde_products
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct469
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.7456199
Minimum1
Maximum7837
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:58.861304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9
Q129
median67
Q3135
95-th percentile382
Maximum7837
Range7836
Interquartile range (IQR)106

Descriptive statistics

Standard deviation269.8785162
Coefficient of variation (CV)2.198681439
Kurtosis354.7550751
Mean122.7456199
Median Absolute Deviation (MAD)44
Skewness15.70464041
Sum364309
Variance72834.41353
MonotonicityNot monotonic
2021-12-04T20:54:58.983305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2845
 
1.5%
2038
 
1.3%
3535
 
1.2%
1533
 
1.1%
2933
 
1.1%
1933
 
1.1%
1132
 
1.1%
2631
 
1.0%
2730
 
1.0%
2529
 
1.0%
Other values (459)2629
88.6%
ValueCountFrequency (%)
16
 
0.2%
214
0.5%
315
0.5%
417
0.6%
526
0.9%
629
1.0%
718
0.6%
819
0.6%
927
0.9%
1027
0.9%
ValueCountFrequency (%)
78371
< 0.1%
56701
< 0.1%
50951
< 0.1%
45771
< 0.1%
26981
< 0.1%
23791
< 0.1%
20601
< 0.1%
18181
< 0.1%
16731
< 0.1%
16361
< 0.1%

avg_ticket
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct2965
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.99655282
Minimum2.150588235
Maximum4453.43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:59.113306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum2.150588235
5-th percentile4.915887985
Q113.11811111
median17.96548505
Q324.98179365
95-th percentile90.052125
Maximum4453.43
Range4451.279412
Interquartile range (IQR)11.86368254

Descriptive statistics

Standard deviation119.5318165
Coefficient of variation (CV)3.622554671
Kurtosis812.969606
Mean32.99655282
Median Absolute Deviation (MAD)5.980669355
Skewness25.15706781
Sum97933.76878
Variance14287.85517
MonotonicityNot monotonic
2021-12-04T20:54:59.240305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
152
 
0.1%
4.1622
 
0.1%
14.478333332
 
0.1%
18.152222221
 
< 0.1%
13.927368421
 
< 0.1%
36.244117651
 
< 0.1%
29.784166671
 
< 0.1%
22.87926231
 
< 0.1%
20.511041671
 
< 0.1%
149.0251
 
< 0.1%
Other values (2955)2955
99.6%
ValueCountFrequency (%)
2.1505882351
< 0.1%
2.43251
< 0.1%
2.4623711341
< 0.1%
2.5112413791
< 0.1%
2.5153333331
< 0.1%
2.651
< 0.1%
2.6569318181
< 0.1%
2.7075982531
< 0.1%
2.7606215721
< 0.1%
2.7704641911
< 0.1%
ValueCountFrequency (%)
4453.431
< 0.1%
3202.921
< 0.1%
1687.21
< 0.1%
952.98751
< 0.1%
872.131
< 0.1%
841.02144931
< 0.1%
651.16833331
< 0.1%
6401
< 0.1%
624.41
< 0.1%
615.751
< 0.1%

avg_recency_days
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION

Distinct1258
Distinct (%)42.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.30505288
Minimum1
Maximum366
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:59.356337image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8
Q125.9271978
median48.26785714
Q385.33333333
95-th percentile200.65
Maximum366
Range365
Interquartile range (IQR)59.40613553

Descriptive statistics

Standard deviation63.50325927
Coefficient of variation (CV)0.9435139941
Kurtosis4.908645262
Mean67.30505288
Median Absolute Deviation (MAD)26.26785714
Skewness2.06622239
Sum199761.397
Variance4032.663938
MonotonicityNot monotonic
2021-12-04T20:54:59.470302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1425
 
0.8%
422
 
0.7%
7021
 
0.7%
720
 
0.7%
3519
 
0.6%
4918
 
0.6%
1117
 
0.6%
4617
 
0.6%
2117
 
0.6%
2816
 
0.5%
Other values (1248)2776
93.5%
ValueCountFrequency (%)
116
0.5%
1.51
 
< 0.1%
213
0.4%
2.51
 
< 0.1%
2.6013986011
 
< 0.1%
315
0.5%
3.3214285711
 
< 0.1%
3.3303571431
 
< 0.1%
3.52
 
0.1%
422
0.7%
ValueCountFrequency (%)
3661
 
< 0.1%
3651
 
< 0.1%
3631
 
< 0.1%
3621
 
< 0.1%
3572
0.1%
3561
 
< 0.1%
3552
0.1%
3521
 
< 0.1%
3512
0.1%
3503
0.1%

frequency
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct1225
Distinct (%)41.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1138262908
Minimum0.005449591281
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:59.586331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0.005449591281
5-th percentile0.008893504781
Q10.01633986928
median0.02589835169
Q30.04942659085
95-th percentile1
Maximum17
Range16.99455041
Interquartile range (IQR)0.03308672157

Descriptive statistics

Standard deviation0.4082214549
Coefficient of variation (CV)3.586354717
Kurtosis989.0590635
Mean0.1138262908
Median Absolute Deviation (MAD)0.0121968864
Skewness24.87675009
Sum337.8364311
Variance0.1666447562
MonotonicityNot monotonic
2021-12-04T20:54:59.702306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1198
 
6.7%
0.0277777777817
 
0.6%
0.062517
 
0.6%
0.0238095238116
 
0.5%
0.0909090909115
 
0.5%
0.0833333333315
 
0.5%
0.0344827586214
 
0.5%
0.0294117647114
 
0.5%
0.0357142857113
 
0.4%
0.0769230769213
 
0.4%
Other values (1215)2636
88.8%
ValueCountFrequency (%)
0.0054495912811
 
< 0.1%
0.0054644808741
 
< 0.1%
0.0054794520551
 
< 0.1%
0.0054945054951
 
< 0.1%
0.0055865921792
0.1%
0.0056022408961
 
< 0.1%
0.0056179775282
0.1%
0.005665722381
 
< 0.1%
0.0056818181822
0.1%
0.0056980056983
0.1%
ValueCountFrequency (%)
171
 
< 0.1%
31
 
< 0.1%
26
 
0.2%
1.1428571431
 
< 0.1%
1198
6.7%
0.751
 
< 0.1%
0.66666666673
 
0.1%
0.5508021391
 
< 0.1%
0.53351206431
 
< 0.1%
0.53
 
0.1%

qtde_returns
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct213
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.88847709
Minimum0
Maximum9014
Zeros1481
Zeros (%)49.9%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:54:59.825306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q39
95-th percentile100
Maximum9014
Range9014
Interquartile range (IQR)9

Descriptive statistics

Standard deviation282.864784
Coefficient of variation (CV)8.107685048
Kurtosis596.2019916
Mean34.88847709
Median Absolute Deviation (MAD)1
Skewness21.9754032
Sum103549
Variance80012.48604
MonotonicityNot monotonic
2021-12-04T20:54:59.937307image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01481
49.9%
1164
 
5.5%
2148
 
5.0%
3105
 
3.5%
489
 
3.0%
678
 
2.6%
561
 
2.1%
1251
 
1.7%
743
 
1.4%
843
 
1.4%
Other values (203)705
23.8%
ValueCountFrequency (%)
01481
49.9%
1164
 
5.5%
2148
 
5.0%
3105
 
3.5%
489
 
3.0%
561
 
2.1%
678
 
2.6%
743
 
1.4%
843
 
1.4%
941
 
1.4%
ValueCountFrequency (%)
90141
< 0.1%
80041
< 0.1%
44271
< 0.1%
37681
< 0.1%
33321
< 0.1%
28781
< 0.1%
20221
< 0.1%
20121
< 0.1%
17761
< 0.1%
15941
< 0.1%

avg_basket_size
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1972
Distinct (%)66.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean235.7885065
Minimum1
Maximum6009.333333
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:55:00.058307image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile44
Q1103.2375
median172
Q3281.375
95-th percentile598.345
Maximum6009.333333
Range6008.333333
Interquartile range (IQR)178.1375

Descriptive statistics

Standard deviation283.7237528
Coefficient of variation (CV)1.203297637
Kurtosis103.0742725
Mean235.7885065
Median Absolute Deviation (MAD)82.625
Skewness7.717538936
Sum699820.2873
Variance80499.16789
MonotonicityNot monotonic
2021-12-04T20:55:00.185305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10011
 
0.4%
11410
 
0.3%
739
 
0.3%
869
 
0.3%
829
 
0.3%
888
 
0.3%
758
 
0.3%
608
 
0.3%
1368
 
0.3%
1307
 
0.2%
Other values (1962)2881
97.1%
ValueCountFrequency (%)
12
0.1%
21
< 0.1%
3.3333333331
< 0.1%
5.3333333331
< 0.1%
5.6666666671
< 0.1%
6.1428571431
< 0.1%
7.51
< 0.1%
91
< 0.1%
9.51
< 0.1%
111
< 0.1%
ValueCountFrequency (%)
6009.3333331
< 0.1%
42821
< 0.1%
39061
< 0.1%
3868.651
< 0.1%
28801
< 0.1%
28011
< 0.1%
2733.9444441
< 0.1%
2518.7692311
< 0.1%
2160.3333331
< 0.1%
2082.2258061
< 0.1%

avg_unique_basket_size
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION

Distinct910
Distinct (%)30.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.49039145
Minimum0.2
Maximum259
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.3 KiB
2021-12-04T20:55:00.328306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile2
Q17.666666667
median13.6
Q322.03571429
95-th percentile46
Maximum259
Range258.8
Interquartile range (IQR)14.36904762

Descriptive statistics

Standard deviation15.4620774
Coefficient of variation (CV)0.8840326672
Kurtosis29.30304319
Mean17.49039145
Median Absolute Deviation (MAD)6.6
Skewness3.434441407
Sum51911.48183
Variance239.0758377
MonotonicityNot monotonic
2021-12-04T20:55:00.480304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1343
 
1.4%
942
 
1.4%
1641
 
1.4%
839
 
1.3%
1737
 
1.2%
1437
 
1.2%
736
 
1.2%
1136
 
1.2%
534
 
1.1%
1534
 
1.1%
Other values (900)2589
87.2%
ValueCountFrequency (%)
0.21
 
< 0.1%
0.253
 
0.1%
0.33333333336
0.2%
0.41
 
< 0.1%
0.40909090911
 
< 0.1%
0.512
0.4%
0.54545454551
 
< 0.1%
0.55555555561
 
< 0.1%
0.57142857141
 
< 0.1%
0.61764705881
 
< 0.1%
ValueCountFrequency (%)
2591
< 0.1%
1771
< 0.1%
1481
< 0.1%
1271
< 0.1%
1051
< 0.1%
1041
< 0.1%
1011
< 0.1%
981
< 0.1%
95.51
< 0.1%
94.333333331
< 0.1%

Interactions

2021-12-04T20:54:55.571336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:39.755305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.148306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.445338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.674304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.032331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.222336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.526331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.938339image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.171336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.462336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.041303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.300331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.668336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:39.877304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.245332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.542338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.772304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.121304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.324342image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.623343image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.027338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.269331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.572336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.141302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.396336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.760331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:39.972303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.344320image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.632330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.969339image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.205336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.423304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.848343image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.118338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.365331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.671303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.235303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.496339image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.853303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.068342image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.442340image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.721331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.066303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.295302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.522339image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.943330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.204336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.461331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.776338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.334303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.593302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.952332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.168319image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.548303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.815337image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.167336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.383302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.627303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.048302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.308303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.558338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.897303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.433303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.690331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.039305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.266305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.643303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.900336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.259305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.471303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.727336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.143339image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.405304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.654304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.993303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.522304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.779346image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.137331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.372340image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.752320image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.001303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.371302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.576302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.836311image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.248337image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.508304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.757336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.116304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.628304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.885304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.239336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.478341image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.858305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.101338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.477340image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.676336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.946303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.356330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.616305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.856336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.231304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.733332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.991331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.334336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.569302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.954304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.190338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.563330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.761337image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.040336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.451331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.704305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.950331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.342305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.827338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.078305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.427330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.766304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.051305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.293332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.660332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.855340image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.138338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.550330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.803304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.054303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.453305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:53.929339image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.179338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.525331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.866306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.152307image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.395303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.756332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:45.954341image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.241303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.650330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.907306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.160324image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.743304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.022336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.285305image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.612303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:40.957304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.244303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.481336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.843330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.039341image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.337331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.742331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:49.994303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.256335image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.840303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.113336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.375331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:56.705331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:41.054331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:42.345303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:43.579302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:44.940337image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:46.132303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:47.433336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:48.843303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:50.086301image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:51.357303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:52.940302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:54.209302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-12-04T20:54:55.471330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Correlations

2021-12-04T20:55:00.617306image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-12-04T20:55:00.808304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-12-04T20:55:01.000332image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-12-04T20:55:01.198304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2021-12-04T20:54:56.887336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
A simple visualization of nullity by column.
2021-12-04T20:54:57.097342image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexcustomer_idgross_revenuerecency_daysqtde_invoicesqtde_itemsqtde_productsavg_ticketavg_recency_daysfrequencyqtde_returnsavg_basket_sizeavg_unique_basket_size
00178505391.21372.0034.001733.00297.0018.1535.5017.0040.0050.970.62
11130473232.5956.009.001390.00171.0018.9027.250.0335.00154.4411.67
22125836705.382.0015.005028.00232.0028.9023.190.0450.00335.207.60
3313748948.2595.005.00439.0028.0033.8792.670.020.0087.804.80
4415100876.00333.003.0080.003.00292.008.600.0722.0026.670.33
55152914623.3025.0014.002102.00102.0045.3323.200.0429.00150.144.36
66146885630.877.0021.003621.00327.0017.2218.300.06399.00172.437.05
77178095411.9116.0012.002057.0061.0088.7235.700.0341.00171.423.83
881531160767.900.0091.0038194.002379.0025.544.140.24474.00419.716.23
99160982005.6387.007.00613.0067.0029.9347.670.020.0087.574.86

Last rows

df_indexcustomer_idgross_revenuerecency_daysqtde_invoicesqtde_itemsqtde_productsavg_ticketavg_recency_daysfrequencyqtde_returnsavg_basket_sizeavg_unique_basket_size
29585678177271060.2515.001.00645.0066.0016.066.001.006.00645.0066.00
2959568817232421.522.002.00203.0036.0011.7112.000.150.00101.5015.00
2960568917468137.0010.002.00116.005.0027.404.000.400.0058.002.50
2961570013596697.045.002.00406.00166.004.207.000.250.00203.0066.50
29625706148931237.859.002.00799.0073.0016.962.000.670.00399.5036.00
2963571012479473.2011.001.00382.0030.0015.774.001.0034.00382.0030.00
2964573114126706.137.003.00508.0015.0047.083.000.7550.00169.334.67
29655737135211092.391.003.00733.00435.002.514.500.300.00244.33104.00
2966574715060301.848.004.00262.00120.002.521.002.000.0065.5020.00
2967576612558269.967.001.00196.0011.0024.546.001.00196.00196.0011.00